LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·12h·
Discuss: DEV
💾Cache Algorithms
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·3h·
🚀Tokenizer Performance
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·6h
🗺️Region Inference
Topological Sort: Managing Mutable Structures in Haskell
mmhaskell.com·14h
🪢Rope Data Structures
Rowhammer: TRR on DDR5 DRAM has been broken
comsec.ethz.ch·5h·
Discuss: Hacker News
🏷️Memory Tagging
Building High-Performance Caching in Go: A Practical Guide
dev.to·21h·
Discuss: DEV
🧠Memory Models
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·18h
🗺️Region Inference
Semantic Dictionary Encoding
falvotech.com·7h·
Discuss: Hacker News
🗂️Type Indexing
Power Query Secret Tip to Lightning-Fast Approximate Matches
geeky-gadgets.com·10h
📊Query Optimizers
Rendezvous Hashing Explained (2020)
randorithms.com·2h·
🔗Hash Algorithms
More hardware won’t fix bad engineering
infoworld.com·13h
🔮Branch Predictors
Fastest copy
forums.anandtech.com·6h
Copy Elision
What is Algebraic about Algebraic Effects?
interjectedfuture.com·6h
💫Effect Systems
Google releases VaultGemma, its first privacy-preserving LLM
arstechnica.com·1h
🎲Parser Fuzzing
What Facebook's Memcache Taught Me About Systems Thinking
lorbic.com·5h·
Discuss: Hacker News
Cache-Aware Algorithms
Building a Simple Stack-Based Virtual Machine in Go
blog.phakorn.com·15h·
📚Stack Data Structures
Explaining the LMAX Disruptor
lmax-exchange.github.io·5d·
Discuss: DEV
🎯Ring Buffers
H100 PCIe – 1.86 TB/s memcpy roofline and 8× uplift
news.ycombinator.com·1d·
Discuss: Hacker News
🧠Memory Hierarchy
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·10h·
Discuss: r/LocalLLaMA
🗺️Region Inference
Advancing Semiconductor Design: Intel’s Foveros 2.5D Packaging Technology
semiwiki.com·9h
🔧RISC-V